NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Transformer-based approach for automated context-aware IFC-regulation semantic information alignment

https://doi.org/10.1016/j.autcon.2022.104540

Zhang, Ruichuan; El-Gohary, Nora (January 2023, Automation in Construction)

Full Text Available
Hierarchical Representation and Deep Learning–Based Method for Automatically Transforming Textual Building Codes into Semantic Computable Requirements

https://doi.org/10.1061/(ASCE)CP.1943-5487.0001014

Zhang, Ruichuan; El-Gohary, Nora (September 2022, Journal of Computing in Civil Engineering)

Full Text Available
Semantic Representation Learning and Information Integration of BIM and Regulations

https://doi.org/10.1061/9780784483893.058

Zhang, Ruichuan; El-Gohary, Nora (May 2022, Computing in Civil Engineering 2021)
Issa, R. (Ed.)
Automated checking of the compliance of building information modeling (BIM)-based building designs with relevant codes and regulations requires bridging the semantic gap between the Industry Foundation Classes (IFC) schema and the natural language. In most of the existing automated compliance checking (ACC) systems, the integration of the IFC schema and natural language is realized through hardcoding or predefined rules, ontologies, or dictionaries. These methods require intensive manual engineering effort and are often rigid and difficult to generalize. There is, thus, a need for an automated and meanwhile flexible and generalizable information integration method. To address this need, this paper leverages transformer-based language models to learn the semantic representations of concepts in the building information models (BIMs) and regulatory documents. An automated IFC-regulatory information integration approach based on these learned semantic representations is proposed. The preliminary experimental results show that the proposed approach achieved promising performance—an accuracy of 80%—on integrating IFC and regulatory concepts.
more » « less
Full Text Available
Natural language generation and deep learning for intelligent building codes

https://doi.org/10.1016/j.aei.2022.101557

Zhang, Ruichuan; El-Gohary, Nora (April 2022, Advanced Engineering Informatics)

Full Text Available
Building information modeling, natural language processing, and artificial intelligence for automated compliance checking

https://doi.org/10.4337/9781839105524.00022

Zhang, Ruichuan; El-Gohary, Nora (March 2022, Edward Elgar Publishing)

Full Text Available
A deep neural network-based method for deep information extraction using transfer learning strategies to support automated compliance checking

https://doi.org/10.1016/j.autcon.2021.103834

Zhang, Ruichuan; El-Gohary, Nora (December 2021, Automation in Construction)

Full Text Available
Clustering-Based Approach for Building Code Computability Analysis

https://doi.org/10.1061/(ASCE)CP.1943-5487.0000967

Zhang, Ruichuan; El-Gohary, Nora (November 2021, Journal of Computing in Civil Engineering)

Full Text Available
A Machine-Learning Approach for Semantically-Enriched Building-Code Sentence Generation for Automatic Semantic Analysis

https://doi.org/10.1061/9780784482865.133

Zhang, Ruichuan; El-Gohary, Nora (November 2020, Construction Research Congress 2020)
Tang, P.; Grau, D.; El Asmar, M. (Ed.)
Existing automated code checking (ACC) systems require the extraction of requirements from regulatory textual documents into computer-processable rule representations. The information extraction processes in those ACC systems are based on either human interpretation, manual annotation, or predefined automated information extraction rules. Despite the high performance they showed, rule-based information extraction approaches, by nature, lack sufficient scalability—the rules typically need some level of adaptation if the characteristics of the text change. Machine learning-based methods, instead of relying on hand-crafted rules, automatically capture the underlying patterns of the existing training text and have a great capability of generalizing to a variety of texts. A more scalable, machine learning-based approach is thus needed to achieve a more robust performance across different types of codes/documents for automatically generating semantically-enriched building-code sentences for the purpose of ACC. To address this need, this paper proposes a machine learning-based approach for generating semantically-enriched building-code sentences, which are annotated syntactically and semantically, for supporting IE. For improved robustness and scalability, the proposed approach uses transfer learning strategies to train deep neural network models on both general-domain and domain-specific data. The proposed approach consists of four steps: (1) data preparation and preprocessing; (2) development of a base deep neural network model for generating semantically-enriched building-code sentences; (3) model training using transfer learning strategies; and (4) model evaluation. The proposed approach was evaluated on a corpus of sentences from the 2009 International Building Code (IBC) and the Champaign 2015 IBC Amendments. The preliminary results show that the proposed approach achieved an optimal precision of 88%, recall of 86%, and F1-measure of 87%, indicating good performance.
more » « less
Full Text Available
Unsupervised Machine Learning for Augmented Data Analytics of Building Codes

https://doi.org/10.1061/9780784482438.010

Zhang, Ruichuan; El-Gohary, Nora (June 2019, ASCE International Conference on Computing in Civil Engineering 2019)

Existing automated code checking methods/tools are unable to automatically analyze and represent all types of requirements (e.g., requirements that are too complex or that require human judgement). Recent efforts in the area of augmented data analytics have proposed the use of templates to facilitate the analysis of text. However, most of these efforts have constructed such templates manually, which is labor-intensive. More importantly, it is difficult for manually-developed templates to capture the linguistic variations in building codes. More research is, thus, needed to automate the generation of templates to support the tagging and extraction of information from building codes. To address this need, this paper proposes an unsupervised machine-learning based method to extract sentence templates that describe syntactic and semantic features and patterns from building codes. The proposed method is composed of four main steps: (1) data preprocessing; (2) identifying the different groups of sentence fragments using clustering; (3) identifying the fixed parts and the slots in the templates based on the syntactic and semantic patterns of the sentence fragment groups; and (4) evaluating the extracted templates. The proposed method was implemented and tested on a corpus of text from the International Building Code. An accuracy of 0.76 was achieved.
more » « less
Full Text Available
A machine learning-based approach for building code requirement hierarchy extraction

Zhang, Ruichuan; El-Gohary, Nora (January 2019, 2019 CSCE Annual Conference)

Most of the existing automated code compliance checking (ACC) methods are unable to fully automatically convert complex building-code requirements into computer-processable forms. Such complex requirements usually have hierarchically complex clause and sentence structures. There is, thus, a need to decompose such complex requirements into hierarchies of much smaller, manageable requirement units that would be processable using most of the existing ACC methods. Rule-based methods have been used to deal with such complex requirements and have achieved high performance. However, they lack scalability, because the rules are developed manually and need to be updated and/or adapted when applied to a different type of building code. More research is, thus, needed to develop a scalable method to automatically convert the complex requirements into hierarchies of requirement units to facilitate the succeeding steps of ACC such as information extraction and compliance reasoning. To address this need, this paper proposes a new, machine learning-based method to automatically extract requirement hierarchies from building codes. The proposed method consists of five main steps: (1) data preparation and preprocessing; (2) data adaptation; (3) deep neural network model training for dependency parsing; (4) automated requirement segmentation and restriction interpretation based on the extracted dependencies; and (5) evaluation. The proposed method was trained using the English Treebank data; and was tested on sentences from the 2009 International Building Code (IBC) and the Champaign 2015 IBC Amendments. The preliminary results show that the proposed method achieved an average normalized edit distance of 0.32, a precision of 89%, a recall of 76%, and an F1-measure of 82%, which indicates good requirement hierarchy extraction performance.
more » « less
Full Text Available

Search for: All records